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MEASUREMENT OF GROWTH AND EFFICIENCY 
IN ARITHMETIC (Concluded) 



S. A. COURTIS 
Home and Day School, Detroit 



Of equal importance with the question of the amount of 
growth produced by a year's work is that of the permanence 
of the growth. Table VI gives for Grades 4 to 9 the average 
scores in the speed tests in September, 1909, the highest scores 
made at any time during the year, and the scores of the same 
classes the following September after the long summer vacation. 
From these the gains during the year, the losses during the sum- 
mer can easily be computed. The last two columns show the 
differences between the September tests and the percentage of 
improvement. 

The results are striking and suggestive. The large gains of 
the fifth and sixth grades are offset by the large losses during 
vacation, while the growth of the seventh and eighth grades is 
seen to be relatively permanent. If these results are confirmed 
by future work, the seventh and eighth grades would seem to 
be the place in the course where rigid drill on the combinations 
would be most effective. 

At the beginning of the year it was felt by all concerned that 
there was one question that the year's work must settle, and 
settle very definitely. Do such short, simple, comparative tests 
yield reliable results, or would repeated tests give very different 
relative standings? There is not space here to marshal all the 
evidence that proves the grade averages reliable measures of 
grade abilities, but an examination of the results in Table V 1 
from this point of view will show how consistently grade after 
grade has obtained the same score in successive tests. It is to 
be remembered that all these grades were subject to constant 
practice in their daily work and drill, so that it is only the be- 
fore-and-after-vacation scores, or successive scores during a 

"See Elementary School Teacher (March, 1911), 367. 
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"plateau" that can be expected to be constant. Such scores are, 
however, remarkably constant. Extended practice, as has been 
shown, does produce a rise in score, yet but a part of this is 
permanent, and the abilities in question, slowly built up by hours 
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of study and drill, must be considered relatively stable quan- 
tities. The constancy of very many grade and individual scores 
in repeated .tests proves conclusively that the results obtained 
from even a single test carefully conducted are reliable measures 
of grade ability. 
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On the other hand it is important to state that this is not 
true of the scores of the individuals composing the grade. Even 
when the grade average is perfectly constant, individual scores 
are found to vary. This is shown clearly in Table VII, the in- 
dividual records in the addition speed test of 23 of the 31 mem- 
bers of the eighth grade. The records omitted are those in 
which through absence the series was not complete. The graph 
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Fig. 1 1. — Gains and losses for the year. Broken line shows maximum gain — from 
September, 1909, to highest point reached at any time during year. Dotted line shows 
effect of summer vacation — loss from highest point reached to September, 1910. Full 
line shows net gain for year, September, 1910, over September, 1909. 

(Fig. 12) makes possible the rapid visual comparison of the 
changes in the individual scores with those of the grade average, 
the heavy central line. It will be seen that at such times as the 
grade average is constant, the constancy is due to the fact that 
the variations of certain individuals in one direction are offset 
by the variations of others in the opposite direction. For the 
individual, therefore, repeated tests are needed to determine 
his most frequent score and the range of his variation from it. 
By far the most important fact to be gained from a study 
of Table VII is the wide range of ability within the grade. The 
score of the best girl is on the average twice that of the poorest 
girl, in spite of the fact that the individuals in this class repre- 
sent the survivals of eight years of "passing" on a 70 per cent 
basis. The graph shows plainly that the grade average is really 
representative of very few scores, and that a measure of the 
range of variation is needed in addition to the measure of the 
average performance. The writer has chosen the average devia- 
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GROWTH AND EFFICIENCY IN ARITHMETIC $33 

tion, the average difference of the individual score from the 
grade average, as the measure of variability, and for purposes 
of comparison the average deviations will be expressed as per- 
centages of their respective averages. The significance of the 
measure is shown graphically in the plot. On each side of the 
heavy line representing the grade average, there has been drawn 
a heavy broken line at a distance from the average equal to that 
of the average deviation. Between the two broken lines will 
be found roughly 50 per cent of the class. That is, the excep- 
tionally weak and the exceptionally able have been excluded, 
and the upper and lower limits of ability of the remainder of 
the class are given by the broken lines. Because of this wide 
individual range, therefore, the ability of a class is better 
thought of as a "zone" than as a single average score. A nar- 
row zone indicates a compact well-graded class, the product of 
an efficient system of promotion. Unfortunately the zones from 
the present tests are wide, wider at least than would seem to the 
writer to be expected in any school that prides itself on its 
standards, and he would hesitate to publish such results did not 
returns from other schools using the same tests under the same 
conditions show zones of the same or greater width. It seems 
probable that the condition is general in education at the pres- 
ent time. Within the school other tests and measurements make 
comparisons possible. The variability of the same 23 individual 
girls whose scores in addition were given in Table VII, and 
whose variability was there shown to be 18 per cent, was 3 per 
cent in age, 7 per cent in handwriting (samples measured by 
Thorndike's scales), 15 per cent in formal English grammar, 26 
per cent in height, 30 per cent in memory of important dates 
and men, 60 per cent in ability to reproduce after 24 hours the 
main points of a historical passage read once in class. 

No one result of the testing work has proved more disturb- 
ing to the writer, more startling in its implications, than this 
extreme variability found in all the grades. Its full significance 
can be appreciated only by one who has made a careful study 
of the individual records and then noted the behavior of the 
same individuals in the classroom. The writer believes weak- 
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ness and inefficiency are due more to failure to secure complete, 
all-around development in the fundamental abilities of a sub- 
ject, and standard growths in each grade, than to any other one 
cause. Just how inefficient the present conditions are is shown 
in the case of knowledge of the multiplication tables by the re- 
sults in Table VIII and Fig. 13. The table gives the average 
and the average deviation for each of the four grades in all the 

TABLE VIII 
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multiplication tests of the year. The graph shows the zone of 
ability of each grade shaded in such a way that ground common 
to two or more grades can be detected at once. The over- 
lapping of grade upon grade is such that for two-thirds of the 
year a part of the territory is common to all four grades, while 
a large part of it is at all times common to three grades. The 
table and figure together make very plain the complete break- 
down of the grade idea. For it is to be remembered that the 
zones do not include the extremes, but the central core of the 
class only. In other words, the variations of individual ability 
within the grade are far greater than the increase of knowledge 
or ability from grade to grade. It will be seen, therefore, that 
just so far as these conditions are truly general, the problem of 
the degree of variability that can be permitted within a grade 
is a vital one for graded systems. 

The cause of such breakdown of grade standards is to be 
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found in the inherent differences in individual children. Con- 
sider the effect of vacation, for instance. In Fig. 12 it will be 
seen that as a whole the grade score fell slightly during the Christ- 
mas vacation and the variability increased, while the opposite 
was true of the spring vacation. Individuals 2, 14, and 21, how- 
ever, are consistently demoralized by both vacations. Individual 
6, on the other hand, shows just the opposite effect, while some 
of the others are very slightly affected if at all. One looks in 
vain for any constant effect during the year. The test the last 
week in May is particularly interesting as bearing on this point. 
It represents the trial of a new test, similar to the old but better 
balanced as to units. To many children it proved much easier, 
but there were individuals in every grade to whom it seemed 
more difficult, and in the case of the eighth grade the variability 
was correspondingly increased. From the point of view of 
the year's growth the same thing is evident. The grade work 
that enabled No. 12 in rank in September, 1909, to become No. 
4 in September, 191 o, and No. 18 No. 12, changed the relative 
rank of other members of the grade very little, while there are 
a few whose absolute score also shows but little improvement. 
In other words, education is an individual matter, and no grade 
instruction nor group work can possibly be devised to minister 
adequately and efficiently to the widely differing needs of any 
group selected under the present system. 

The writer, however, is not one of those who believe that 
the needs of the individual will bring about the abolition of our 
present graded system. The necessity for handling large num- 
bers of children and the many benefits that accrue from group 
work make its continuance a certainty. At the same time the 
differences in individuals make it imperative that each child 
receive the kind and amount of instruction that his particular 
mental makeup demands. These two conditions can be met by 
having for each subject and each grade standard scores and 
standard growths in all those component abilities that are funda- 
mental. Individual measurement by standard tests and con- 
stant checking of the results of classwork, supplemented by 
individual prescriptions of study and work, would insure the 
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attainment of any desired degree of minimum ability, and still 
leave room in the nonessentials of the subject for all those per- 
missible individual variations which make for personality. If 
education is to become either scientific or efficient, more atten- 
tion must be given to accurate determinations of both the ma- 
terial to be acted upon, and the effects produced. In the future 
it may well be that a testing department will be a vital part of 
every well-organized school system. 

One other question .will briefly be discussed, the correlation 
between the elemental abilities and the complexes of which they 
form a part. The question of how far drill on the multiplication 
tables, for instance, must be carried in order that a child may be 
able to multiply correctly in the solution of problems is one of 
some importance. Although in all discussions based upon the 
present tests, the defects of the tests themselves and the conse- 
quent possible unreliability of the results must be kept in mind, 
two correlations attempted seem to yield results of value. 

The first is the relation between rate of motor activity and 
total score in the speed tests. It seemed reasonable to suppose 
that a child able to copy figures more rapidly than another 
would obtain higher scores in writing the combination, even 
though the two children really had equal knowledge of the 
tables. To test this point the June scores of 182 individuals of 
all grades above the third were thrown into one large group 
and distributed on the basis of their scores in copying figures. 
The average total score in each of these smaller groups was 
then determined. Figure .14 shows the results graphically. It is 
evident from the curve that there is a very close connection be- 
tween the two types of scores up to a certain point, but that 
beyond this point the ability to copy figures with extreme rapid- 
ity does not carry with it the corresponding ability to obtain a 
high score in the other speed tests. This critical score, there- 
fore, approximately 1 1 5 figures in a minute, has been taken by 
the writer as the standard to be reached before the work in 
arithmetic is completed. Standard scores, graded down from 
this' maximum, have been set for the other classes below the 
eighth, and the attempt is being made by classwork and by 
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Fig. 14. — Correlation between speed in copying figures and total scores in the 
speed tests. 




Fig. 15. — Correlation between total scores in the speed tests and the number of 
examples attempted and right in the test on fundamentals. 
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extra special individual work to bring the score of every indi- 
vidual up to the score for her grade. By thus securing a stand- 
ard growth in each grade, the final maximum ability will be 
insured. 

A second correlation of some importance is the relation be- 
tween total scores in the speed tests and the scores in the tests 
on abstract examples. Figure 15 gives the results both for ex- 
amples attempted and for examples right. The general form 
of the curves is the same. That is, there is probably a critical 
degree of knowledge of the tables to be attained by drill be- 
yond which an increase in score does not carry with it an in- 
crease in ability to work examples. The determination of the 
proper balance between these fundamental abilities of a sub- 
ject affords a large number of important problems for experi- 
mental work. 

No general summary of the points made in the foregoing 
discussions will be attempted. The writer feels, however, that 
the work of the year has proved conclusively the necessity of 
measurements of individual needs and individual growth, as 
well as the great possibilities of comparative tests from the 
standpoint of intelligent determination of courses and methods. 
It has also made possible the construction of tests composed of 
units of known values, so that future measurements will be 
free from this source of error. The immediate work of the 
future is, therefore, the determination of standards for each 
grade and of standard growths. It must also settle the question 
of whether or not the teacher will be able, by use of the knowl- 
edge gained from the tests, to bring individuals up to standard 
scores. It is with these questions that the experimental work 
of the school during the coming year is to deal. 



